The Glicko rating system and Glicko-2 rating system are methods of assessing a player's strength in games of skill, such as

chess Chess is a board game for two players, called White and Black, each controlling an army of chess pieces in their color, with the objective to checkmate the opponent's king. It is sometimes called international chess or Western chess to dist ...

and Go. The Glicko rating system was invented by Mark Glickman in 1995 as an improvement on the

Elo rating system The Elo rating system is a method for calculating the relative skill levels of players in zero-sum games such as chess. It is named after its creator Arpad Elo, a Hungarian-American physics professor. The Elo system was invented as an improved c ...

, and initially intended for the primary use as a

chess rating system A chess rating system is a system used in chess to estimate the strength of a player, based on their performance versus other players. They are used by organizations such as FIDE, the US Chess Federation (USCF or US Chess), International Correspo ...

. Glickman's principal contribution to measurement is "ratings reliability", called RD, for ''ratings deviation''.

Overview

Mark Glickman created the Glicko rating system in 1995 as an improvement on the

. Both the Glicko and Glicko-2 rating systems are under public domain and have been implemented on

game server A game server (also sometimes referred to as a host) is a server which is the authoritative source of events in a multiplayer video game. The server transmits enough data about its internal state to allow its connected clients to maintain their ...

s online (like Pokémon Showdown,

Pokémon Go ''Pokémon Go'' (stylized as ''Pokémon GO'') is a 2016 augmented reality (AR) mobile game, part of the ''Pokémon'' franchise, developed and published by Niantic in collaboration with Nintendo and The Pokémon Company for iOS and Android ...

, Lichess,

Free Internet Chess Server The Free Internet Chess Server (FICS) is a volunteer-run Internet chess server. It was organised as a free alternative to the Internet Chess Club (ICC), after that site began charging for membership. History The first Internet chess server, ...

, Chess.com, Online Go Server (OGS), Counter Strike: Global Offensive,

Quake Live ''Quake Live'' is a first-person arena shooter video game by id Software. It is an updated version of ''Quake III Arena'' that was originally designed as a free-to-play game launched via a web browser plug-in. On September 17, 2014, the game wa ...

Team Fortress 2 ''Team Fortress 2'' is a 2007 multiplayer first-person shooter game developed and published by Valve Corporation. It is the sequel to the 1996 ''Team Fortress'' mod for '' Quake'' and its 1999 remake, '' Team Fortress Classic''. The game was ...

Dota Underlords ''Dota Underlords'' is a 2020 auto battler game developed and published by Valve. The game is based on a popular ''Dota 2'' community-created game mode called ''Dota Auto Chess'', which was released in January 2019. ''Dota Underlords'' first rel ...

Guild Wars 2 ''Guild Wars 2'' is a free-to-play, massively multiplayer online role-playing game developed by ArenaNet and published by NCSoft. Set in the fantasy world of Tyria, the core game follows the re-emergence of Destiny's Edge, a disbanded guild ded ...

Splatoon 2 is a 2017 third-person shooter game developed and published by Nintendo for the Nintendo Switch. It was released on July 21, 2017, and is a direct sequel to ''Splatoon'', which includes a new story-driven single-player mode and various onl ...

, Dominion Online, TETR.IO, and competitive programming competitions. The Reliability Deviation (RD) measures the accuracy of a player's rating, where the RD is equal to one standard deviation. For example, a player with a rating of 1500 and an RD of 50 has a real strength between 1400 and 1600 (two standard deviations from 1500) with 95% confidence. Twice (exact: 1.96) the RD is added and subtracted from their rating to calculate this range. After a game, the amount the rating changes depends on the RD: the change is smaller when the player's RD is low (since their rating is already considered accurate), and also when their opponent's RD is high (since the opponent's true rating is not well known, so little information is being gained). The RD itself decreases after playing a game, but it will increase slowly over time of inactivity. The Glicko-2 rating system improves upon the Glicko rating system and further introduces the ''rating volatility'' σ. A very slightly modified version of the Glicko-2 rating system is implemented by the

Australian Chess Federation The Australian Chess Federation (ACF) is dedicated to promoting the game of chess in Australia, and is a member of FIDE, the World Chess Federation. The ACF administers its own chess rating system for tournaments in Australia, and runs the ACF ...

The algorithm of Glicko

Step 1: Determine ratings deviation

The new Ratings Deviation (

RD

) is found using the old Ratings Deviation (

RD_0

RD = \min\left(\sqrt,350\right)

where

t

is the amount of time (rating periods) since the last competition and '350' is assumed to be the RD of an unrated player. If several games have occurred within one rating period, the method treats them as having happened simultaneously. The rating period may be as long as several months or as short as a few minutes, according to how frequently games are arranged. The constant

c

is based on the uncertainty of a player's skill over a certain amount of time. It can be derived from thorough data analysis, or estimated by considering the length of time that would have to pass before a player's rating deviation would grow to that of an unrated player. If it is assumed that it would take 100 rating periods for a player's rating deviation to return to an initial uncertainty of 350, and a typical player has a rating deviation of 50 then the constant can be found by solving

350 = \sqrt

for

c

. Or

c = \sqrt \approx 34.6

Step 2: Determine new rating

The new ratings, after a series of m games, are determined by the following equation:

r = r_0 + \frac\sum_^

where:

g(RD_i) = \frac

E(s, r_0,r_i,RD_i) = \frac

q = \frac = 0.00575646273

d^2 = \frac

r_i

represents the ratings of the individual opponents.

RD_i

represents the rating deviations of the individual opponents.

s_i

represents the outcome of the individual games. A win is 1, a draw is

\frac

, and a loss is 0.

Step 3: Determine new ratings deviation

The function of the prior RD calculation was to increase the RD appropriately to account for the increasing uncertainty in a player's skill level during a period of non-observation by the model. Now, the RD is updated (decreased) after the series of games:

RD'=\sqrt

Glicko-2 algorithm

Glicko-2 works in a similar way to the original Glicko algorithm, with the addition of a rating volatility

\sigma

which measures the degree of expected fluctuation in a player’s rating, based on how erratic the player's performances are. For instance, a player's rating volatility would be low when they performed at a consistent level, and would increase if they had exceptionally strong results after that period of consistency. A simplified explanation of the Glicko-2 algorithm is presented below:

Step 1: Compute ancillary quantities

Across one rating period, a player with a current rating

\mu

and ratings deviation

\phi

plays against

m

opponents, with ratings

\mu_1,...,\mu_m

and RDs

\phi_1,...,\phi_m

, resulting in scores

s_1,...,s_m

. We first need to compute the ancillary quantities

v

and

\Delta

v = \left \sum_^m g(\phi_j)^2 E(\mu,\mu_j,\phi_j) \ \right

\Delta = v \sum_^m g(\phi_j) \

where

g(\phi) = \frac,

E(\mu,\mu_j,\phi_j) = \frac.

Step 2: Determine new rating volatility

We then need to choose a small constant

\tau

which constrains the volatility over time, for instance

\tau=0.2

(smaller values of

\tau

prevent dramatic rating changes after upset results). Then, for

f(x) = \frac\frac - \frac,

we need to find the value

A

which satisfies

f(A)=0

. An efficient way of solving this would be to use the Illinois algorithm, a modified version of the '' regula falsi'' procedure (see for details on how this would be done). Once this iterative procedure is complete, we set the new rating volatility

\sigma'

\sigma' = \exp\.

Step 3: Determine new ratings deviation and rating

We then get the new RD

\phi' = 1\Big/\sqrt,

and new rating

\mu' = \mu + \phi'^2 \sum_^m g(\phi_j) \.

These ratings and RDs are on a different scale than in the original Glicko algorithm, and would need to be converted to properly compare the two.

References

External links

Professor Glickman's Glicko-Website
*

TrueSkill TrueSkill is a skill-based ranking system developed by Microsoft for use with video game matchmaking on Xbox Live. Unlike the popular Elo rating system, which was initially designed for chess, TrueSkill is designed to support games with more than t ...

br>
rating system by Microsoft borrows many ideas of Glicko.
forwardloop/glicko2s
Glicko-2 implementation for the JVM
RobKohr/glicko
JavaScript Glicko-2 implementation.
mmai/glicko2js
Client side javascript and node.js Glicko-2 implementation
deepy/glicko2
Python Glicko-2 implementation.
sublee/glicko2
Python Glicko-2 implementation.
PlayerRatings
R Glicko implementation by Alec Stephenson and Jeff Sonas.
scala-glicko2
Scala Glicko-2 implementation.
dimos/glicko2
Glicko-2 implementation for Scala and Scala.js {{Sports rating systems Chess in Australia Chess rating systems